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Abstract 

In this paper a fuzzy clustering model for fuzzy data with outliers is proposed. The model 
is based on Wasserstein distance between interval valued data which is generalized to fuzzy 
data. In addition, Keller's approach is used to identify outliers and reduce their influences. 
We have also defined a transformation to change our distance to the Euclidean distance. With 
the help of this approach, the problem of fuzzy clustering of fuzzy data is reduced to fuzzy 
clustering of crisp data. In order to show the performance of the proposed clustering algorithm, 
two simulation experiments are discussed. 

keywords: fuzzy clustering, fuzzy data, Wasserstein distance, outliers 
1 Introduction 

Clustering is a division of a given set of objects into subgroups or clusters, so that 
objects in the same cluster are as similar as possible, and objects in different clusters 
are as dissimilar as possible. From a machine learning perspective, clustering is 
an unsupervised learning of a hidden data concept (Berkhin [3J). In conventional 
(hard) clustering analysis, each datum belongs to exactly one cluster, whereas in 
fuzzy clustering, data points can belong to more than one cluster, and associated 
with each datum is a set of membership degrees. 

Fuzzy data are imprecise data obtained from measurements, human judgements 
or linguistic assessments. In cluster analysis, when there is simultaneous uncertainty 
in both the partition and data, a fuzzy clustering model for fuzzy data should be 
applied (D'Urso and Giordani [TTj). 

In recent literature, there are several works regarding the fuzzy clustering of fuzzy 
data. Hathaway et al. [18J and Pedrycz et al. [2H] introduced models that convert 
parametric or non-parametric linguistic variables into generalized coordinates before 
performing fuzzy c-means clustering. Yang and Ko [35] presented a fuzzy k-numbers 
clustering model that uses a squared distance between each pair of fuzzy numbers. 
Yang and Liu [38] extended the Yang and Ko work and proposed a fuzzy k-means 
clustering model for conical fuzzy vectors. Yang et al. [36] proposed a fuzzy K- 
means clustering model for handling both symbolic and fuzzy data. Hung and Yang 
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[20] proposed an alternative fuzzy k-numbers clustering model which is based on 
exponential-type distance measure. D'Urso and Giordani [11] proposed a weighted 
fuzzy c-means clustering model which considers fuzzy data with a symmetric LR 
membership function. 

In this paper, we first propose a new distance measure for comparison of fuzzy 
data. On account of the fact that all the a-cuts of fuzzy data are intervals, we obtain 
the distance between two fuzzy data from the distances between their a-cuts. To 
this purpose, a special case of Wasserstein distance is utilized. The choice of a-cuts 
is motivated by the fact that, fuzzy data with different shapes can be used. After 
introducing our distance, we use it for fuzzy clustering of fuzzy data. Moreover, 
with the help of Keller's [22] approach, an additional weighting factor is added 
for each datum to identify outliers and reduce their effects. In other approach, by 
definition of a transformation, triangular fuzzy data are changed to crisp data. With 
this novel approch, after applying the transformation, any fuzzy clustering model 
for crisp data can be used. Furthermore, for determining the optimal number of 
clusters, there is no need to define a cluster validity index for fuzzy data. The ones 
existing in literature for crisp data can be applied. 

The rest of the paper is organized as follows. In Section 2, the concept of LR- 
type fuzzy data is introduced. Some related works regarding metrics for fuzzy data 
are reviewed in Section 3. We propose a distance measure for fuzzy data based on 
Wasserstein Metric in Section 4; by means of this distance and following Keller's ap- 
proach, we propose a fuzzy clustering model for fuzzy data with outliers. Then, by 
defining of a new transformation, we change the fuzzy data to crisp data and for the 
sake of comparison, we again use Keller's algorithm (Section 5). Successively, in Sec- 
tion 6, the results of two simulation experiments are discussed. Finally, conclusions 
and future works are presented in Section 7. 



2 LR-type fuzzy data 

The LR-type fuzzy data represent a general class of fuzzy data. When we are dealing 
with univariate LR fuzzy data, this kind of data can be shown by a vector of LR- 
fuzzy numbers. In the more general case of multivariate analysis, we have a matrix of 
LR-fuzzy numbers (De 01iveria,Pedrycz [9]). To be more specific, let L (and R) be a 
decreasing shape function, which map R + — > [0, 1] with L(0) = 1; L(x) < 1, Vx > 0; 
L(x) > 0,Vx < 1; L(l) = or (L(x) > 0,Vx and L(+oo) = 0) (Zimmerman, [32] ). 
Then, a fuzzy number A is of LR-type if for c, I > 0, r > in R, 

M x > ~ \r(*=il) forx>c. { ) 

where, c, I, r are the center, left and right spreads of A, respectively. Symbolically 
we can write A = (c, /, t)lr. 



2 



In LR-type fuzzy numbers, the triangular fuzzy numbers (TFNs) are most com- 
monly used. An LR-type fuzzy number A is called triangular fuzzy number if 
L(x) = R(x) = 1 — x, characterized by the following membership function: 



l- c -f for x < c, 
1 — for x > c. 

r — 



(2) 



3 Related works 

In the recent literature, there are some distance measures for fuzzy data. We review 
some of them in this section. 

Definition: Considering two crisp sets A,BC. and a distance d(x,y) where, 
x G A and y G B, the Hausdorff distance is defined as follows: 



(3) 



dn{A, B) = max < sup inf d(x, y), sup inf d(x, y) > . 



According to the concept of a-cuts, the Hausdorff metric du can be generalized to 
fuzzy numbers F, G, where F(or G)iM. — > [0, 1]: 



d p (F,G) 



(dii(F a , G a )) p d a 
sup d H (F a ,G a ) 

ae[0,l] 



1/p 



if p G [1, oo) 
if p = oo, 



(4) 



where, the crisp set F a = {x G M. k : F(x) > a}, a G [0, 1], is called the a-cut of F 
(Nather,[26]). 

J proposed the following distance between two intervals: 



Tran and Duckstein 
d TD (A,B) = 



a + b 



+ x{b — a) 



u + v 
2 

a + b 



+ y(v-u) 



u + v 



dx dy 
1 

+ 3 



b — a 



v — u 



(5) 



Then, they used it to formulate their distance measure for fuzzy numbers, but g?tl> 
does not satisfy the reflexivity property (Irpino and Verde [21]): 



d TD (A,A) 



a + b\ /a + b 



1 2 



b — a 



b — a 



2 fb-a 



> 0. 



(6) 
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A squared Euclidean distance between a pair of LR-type fuzzy data A\ = (ci, h,ri) 
and A 2 = (c 2 ,h, ^2) , where c denotes the center and Z,r indicate, respectively, the 
left and right spread, is defined by Yang and Ko [35] : 

d 2 YK {\, p) = (ci - c 2 f + [(ci - A/i) - (ca - A/ 2 )f + [(ci + pri) - (c 2 + pr 2 )] 2 , (7) 

where A = f Q L^it) dt, p = f Rr l {t)dt are parameters that summarize the shape 
of the left and right tails of the membership function and L, R are decreasing shape 
functions which were defined in Section 2. 

4 The proposed distance for fuzzy data 

In this section, we first present a new distance measure for interval-valued data, and 
then it is used to formulate the distance measure for fuzzy data. Let U = [a«, be 
an interval for i = 1,2. We can parameterize as follows: 

I l {t) = a i + t{b l -a i ) < t < 1. (8) 

If we represent Jj by means of its midpoint rrii = 2i±k anc [ radius 5i = bi ~ ai , Eq.8 
can be rewritten as follows: 

Ii(t) =m t + (2t - l)8i < t < 1. (9) 

The distance measure between 1% and J 2 can be defined as follows: 

d 2 (I 1 ,I 2 )= I \h{t) - I 2 {t)f dt 
Jo 

= [ {(m 1 -m 2 ) + (5 1 -S 2 )(2t-l)] 2 dt 
Jo 

= (m 1 -m 2 ) 2 + i(5 1 -5 2 ) 2 . (10) 

This distance takes into account all the points in both intervals. Irpino and Verde 
[21] has derived Eq.10 from another point of view, using the Wasserstein distance. 
To be more specific, let F\ and F 2 be distribution functions, the Wasserstein L 2 
metric is defined as follows (Gibbs and Su [E]): 

Ui >| 1/2 

{F{\t)-F 2 -\t)fdt\ , (11) 

where F^ 1 and F 2 ~ l are the quantile functions of the two distributions. If we assume 
Fi for i — 1, 2 to be the uniform distribution function on [a*, fej ], then i^ _1 (t) is the 
same as the parametric representation in Eq.8. Thus, the Wasserstein distance 
coincides with the distance defined in Eq.10. 
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Now we are ready to construct a distance between fuzzy data. According to ex- 
eats, the Wasserstein distance dw ass can be generalized to fuzzy numbers A\ and 
A 2 : 



d(A u A 2 ) = {j^ d^HA^iA^daj . (12) 

We calculate this distance for triangular fuzzy numbers. Let Ai = (q, k, r$), i — 1, 2 
be triangular fuzzy numbers and (Ai) a = [ZjO!+(cj— — rjO!+(cj+rj)], the midpoint 
and the radius of (Ai) a are as follows: 

m (A,) a ^Ci + -(l-a)(ri-k). (13) 



Then we have: 

d 2 (A u A 2 ) = [ d 2 Wass ((A 1 ) a ,(A 2 ) a )da 
Jo 

= I [l m (A l)a ~ m (A 2 )f + l[ S (A l)a ~ 5 (A 2 )f] da 

= f { ((ci - c 2 ) + 1(1 - a) [(r a - r 2 ) - (/ x - / 2 )] ) ' 

+ ^(l-«) 2 [(r 1 -r 2 ) + (/ 1 -/ 2 )] 2 |rfa 

= (ci - c 2 ) 2 + J [(i a - l 2 ) 2 + (n - r 2 ) 2 - (i a - i 2 )( ri - r 2 

-^(^-^[^-^-(rx-r,)]. (15) 

We can use the distance (fT5]) to define a distance between any two vectors of fuzzy 
numbers, by considering the sum of squared distances between individual elements. 
[See equation fl20|) ahead for more details.] In the next section, this distance is used 
for fuzzy clustering of fuzzy data. 



5 Fuzzy clustering of fuzzy data with outliers 

In this section we propose two approaches. In the first approach, based on our 
distance, we propose a fuzzy clustering model for fuzzy data, by modifying Keller's 
algorithm [22] . In the second approach, by defining a transformation, we reduce the 
problem of fuzzy clustering of fuzzy data to fuzzy clustering of crisp data. With the 
help of the second approach, any fuzzy clutering algorithms for crisp data can be 
used for fuzzy clustering of fuzzy data. For the sake of comparison with the first 
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approach, we again use Keller's algorithm. Before describing the approaches, let us 
introduce some notations. 

Let U = {uik '■ i = 1, . . . , c; k = 1, . . . , n} be the c x n membership matrix, 
where c is the number of clusters, n the number of data vectors and un~ G [0, 1] 
the membership degree of the k-th object to the i-th cluster. We consider each data 
point, denoted as Xk, and each cluster prototype, denoted as v^, to be a p-dimensional 
vector of triangular fuzzy data. This is in contrast to the Keller's approach where 
data elements and cluster prototypes are crisp. 

To be more specific, let Xkj denote the j-ih component of Xk, the k-th data point. 
Then, be represented as a 3- vector collecting its center, left spread and right 

spread. In symbols, we have 

x kj ■= [cx kJ k kJ rz kJ ] T E M 3 , (16) 

x k := [x T kl x T k2 ••• 5fJ T GM 3p , (17) 

for k = 1, . . . , n. In other words, we may view each data point, Xk, either as a 
p-dimensional vector of fuzzy elements Xkj or as a 3p-dimensional vector of real 
numbers. Both viewpoints are helpful and will be used interchangeably in what 
follows. A similar representation will be used for cluster prototypes, t>j. That is, 

^■-k-fc,/?/^ 3 , (is) 

-?J T e#, (19) 

for i — 1, . . . , c. 

As mentioned earlier, we consider the following (squared) distance between fuzzy 
vectors Xk and v^, 

p 

d 2 (vi, x k ) d 2 (vij, x kj ), (20) 

3=1 

where, d 2 (yij,Xkj) is the (squared) distance (TL5]) between fuzzy numbers Vij and Xkj- 



5.1 Approach I 

Following Keller, we minimize the objective function: 

c n 1 

J(U, V;X) = J2J2<k-- q - d'iv^Xk). (21) 
subject to the constraints 

J2^k = co, (22) 

k=l 
c 

^ j u ik = l 1 (23) 



Tl 



i=l 



G 



where, m is the degree of fuzziness and d 2 (yi,Xk) is as defined in (EDI) . 

The factor uik represents the weight of the kth datum and u is a constant real 
valued parameter. According to Keller, the introduction of these weight factors 
helps in identifying outliers and reducing their effects. With constant parameter 
q, the influence of the outlier weight factors can be controlled. For this purpose, 
outliers are assigned a large weight cj/., so \ is small in this case. 

The necessary conditions for minimizing the objective function are as follows: 



E 

k=l 



ik 



UJ 



n 1 

z2< k ■ 



fc=l 



k=l 
1 

9< 



g ^^kj g x^^ij ^-^kj ) ^ ^^kj j 



9^ lk Lot 



n 1 



fc=l 



k=l 
1 

9 l 



E 



fc=i 



i 

9 + 1 



i ^ i=i 



9 + 1 



UJ. 



Uik 



E 



r=l 



d 2 (v r ,x k ) 



(24) 



(25) 



(26) 



(27) 



(28) 



As it is observed, the membership degrees are left unchanged, while the cluster 
centers take into account the weights; points with high representativeness are more 
effective than outliers. On the basis of the necessary conditions, we can construct 
an iterative algorithm as follows: 

Algorithm: 



Step 1. Fix the degree of fuzziness (m), the number of clusters (c), u and q. Choose 
an initial fuzzy c-partition £A '. Also, choose initial spreads and weights for each 
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datum subject to Eq. fl22l) . Set t=0. 

Step 2. Calculate V® = (4\l [ ~\r^) using U®, spreads, weights and Eqs.f^MB 
Step 3. Update ujf } k = 1, • • • ,n using Eq.([27D and update U {t) by f/ (m) using 
V {t) = (4 ) >$> r ®) and Eq.(l2HD 

Step 4. If — U®\\ < e , where e is a non-negative small number fixed by the 

researcher, the algorithm has converged. Otherwise, set t = t + 1 and go to step 2. 



5.2 Approach II 



This approach is based on a different view of the distance (1201) . With some linear 
algebra, one can reduce this distance to the usual 3p- dimensional Euclidean distance. 

For any iV-vector, say y = . . . , y^) G M. N , let ||y|| 2 := ( J2iLi Hi) ^ denote its 
Euclidean norm. 

Consider two triangular fuzzy numbers Ai = (q, U, j**), i = 1, 2. Letting c = c\— c 2 , 
I — 1% — I2, r — r% — r 2 and z = [c, Z, r] T , Eq. f[T5"j) can be rewritten as: 



1. 1,1 

— tr cl H — cr 

9 2 2 



(29) 



or equivalently as: 



d 2 (A 1 ,A 2 ) = [c, I, r] 



i_ 

18 



1 

\ 

,18 





c 




Z 




r 



(30) 



-12-, A 2 



Let us denote the matrix above as Q. The eigenvalues of Q are Ai 

Yg and A3 = r ~^* ■ Since Q is a real symmetric matrix, it is diagonalizable by 
orthogonal matrices. That is, there is an orthogonal 3x3 matrix U (whose columns 
are orthonormal eigenvectors of Q) for which we have 



Q = U 



Ai 
A 2 
A 3 



u 1 



(31) 



Let T be the (symmetric) square root of Q, i.e., 

T := Q 1 ' 2 = UA X / 2 U T . 

Then, we may write 

d{A u A 2 ) = ^z T Qz = ^z T Q 1 / 2 Q 1 / 2 z = Vz T T T Tz 



\Tz\ 



2- 



(32) 



(33) 
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Now, recalling definitions (ITS]) and (fTSl) of Xkj and consider the following 
transformations 

x kj := Tx kj , Vij := Tv^ (34) 

where x^j and tijj are treated as 3- vectors. Furthermore, let us stack {xkj}j and 
{vij}j into 3p- dimensional vectors as usual, i.e., 



x k :-- 



X 



I 



fcl 



a; 



t it 

kp 



] e M 3p , 



;2 



o 3p 



Combining ( )20|) . (|33|) and the definitions of Sfc and we obtain 

d(x k ,Vi) 



\ 3=1 



\ 



(35) 



Equation (j35j) shows that the distance between fuzzy vectors and is the same as 
the Euclidean distance between the transformed vectors x k and «j. In other words, 
we have reduced the problem of fuzzy clustering of fuzzy data to fuzzy clustering of 
crisp data. Thus, after applying transformations ( |34|) . any fuzzy clutering algorithm 
for crisp data can be used. 

In particular, we can directly apply Keller's algorithm to {xk} by minimizing the 
objective function 



c n 1 



\%k Vi\\2i 



(36) 



1=1 k=l 



under the same constraints (|22|) and (|23|) on {oj k } and {w^}. 
Necessary conditions for minimizing (1361) follows: 



E 

fc=i 



1 



(37) 



fc=i 



Vi, Xk 



Uik 



i=l 



9 + 1 



■ OJ. 



1=1 ^i=l 



9 + 1 



E 



<j 2 (^,£ fc ) 
^ V^ 2 (^V,^fc) 



(38) 



(39) 
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After iterations, equation (1371) provides cluster prototypes in the transformed do- 
main. To retrieve the fuzzy prototypes, one should apply the inverse transformation 
i.e. Vij = T -1 dij, for i = 1, . . . , c and j = 1, . . . ,p. 

6 Simulation experiments 

In order to show how well our method works, two simulation experiments are con- 
ducted; one in an environment without outliers and the other one in presence of 
outliers. We almost obtained the same results with both approaches. 

6.1 Clustering fuzzy data without outliers 

We now discuss the results of a simulation study carried out in order to compare 
the performance of our model with existing models able to handle fuzzy data. These 
models are proposed by D'Urso and Giordani [IT] , by Yang et al. [36], by Hathaway 
et al. [18] and by Yang and Liu |38j. In order to compare the models, 2160 fuzzy data 
sets were randomly generated. After running several models for different values of q 
and oj, we chose q = 1, u> = 200. The other parameters for clustering algorithm were 
set as follows: Number of objects (n = 10, 50, 100), number of variables (k = 2, 8, 16) 
and the weighting exponent (m = 2, 3). We constructed the data sets in such a way 
that c = 2 patterns can be found all over the simulation. To this purpose, the centers 
corresponding to the first n/2 objects were generated from the uniform distribution 
in [0,1], and those corresponding to the latter n/2 from the uniform distribution 
in [0 + 9, 1 + 9]. All the spreads were generated from the uniform distribution in 
[0, 1] (case a). On the other hand, in case 0, all the centers were generated from the 
uniform distribution in [0, 1], while the spreads corresponding to the first n/2 objects 
were generated from the uniform distribution in [0,1], and those corresponding to 
the latter n/2 from the uniform distribution in [0 + 9, 1 + 9]. 9 was set to 1.5 and 
0.75. In case of 9 = 1.5, the clusters are separated, whereas they are overlapped 
when 9 is set to 0.75. Moreover, three sizes of centers with respect to the ones of the 
spreads were considered by defining a parameter h having the values 1/2,1,2. This 
parameter means that the size of the spreads is h times that of the centers. 

In tables 1 and 2, the percentage of well- classified objects by the models are 
given. This is done by fixing one parameter at a time and averaging over the rest. 
So, the left columns of table 1 and 2 display the fixed parameters. Inasmuch as 
the cluster membership functions were known in advance, it is presumed that an 
object is assigned to a cluster correctly if the membership degree was the highest 
among all (u = 0.5). In addition, membership degrees higher than 0.75 and 0.9 
are reported so that the strength of our model can be evaluated . It can be seen 
that our model works better in most of the conditions. As a case in point, when 
9 = 1.5 and in conditions n = 10,50,100, k = 2,8,16, a, m — 2,3, our model 
had better performance for u = 0.5, 0.75, 0.9. When 9 = 0.75 and in conditions n = 
10, 50, 100, k — 2,8, 16, m — 2, 3, our model worked better, whereas in case (3 the 
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models proposed by Hathaway et al. and Yang and Liu, had the best performance. 
As reported in the tables 3,4, when 6 = 1.5, the average percentage of well-classified 
objects for our model is 97.39 (u = 0.5), 71.38 (u = 0.75) and 33.69 (u = 0.9). The 
model proposed by D'Urso and Giordini had the second highest performance after 
our model with 93.25 (u = 0.5), 60.81 (u = 0.75), 27.93 (u = 0.9). When 9 passed 
from 1.5 to 0.75, the average performance of all models got worse. In this case, 
the average percentage of well-classified objects for our model is 91.12 (u = 0.5), 
37.70 {u = 0.75) and 11.09 {u = 0.9) and for the model proposed by D'Urso and 
Giordini is 89.00 (u = 0.5), 32.56 (u = 0.75) and 7.35 (u = 0.9). As mentioned 
earlier, the simulation study showed that our model had much better results than 
the other existing models. 

6.2 Clustering fuzzy data with Outliers 

In order to evaluate how our model is able to detect the prototypes in case of possi- 
ble presence of observations that can be seen as outliers, we added some outliers to 
cases a and j3, mentioned above. After running several models for different values 
of q and u, we chose q = 2, u = 200. The other parameters for clustering algorithm 
were set as follows: Number of objects (n = 100,200,300), where n/10 of them are 
outliers and the rest of the them are inliers, number of variables k = (2, 8, 16) and 
the weighting exponent (m = 2). The modified cases a and (3 are as follows: 

• case a: The centers corresponding to the first 1/2 of inliers were generated 
from the uniform distribution in [0, 1], and those corresponding to the rest of 
the inliers from the uniform distribution in [1.5,2.5]. The number of outliers 
is n/10. The centers of outliers were generated from Normal distribution with 
mean=-2 and variance=2. The left and the right spreads were generated from 
the uniform distribution in [0, 1] 

• case [5: The left and the right spreads corresponding to the first 1/2 of inliers 
were generated from the uniform distribution in [0, 1], and those corresponding 
to the rest of the inliers from the uniform distribution in [1.5, 2.5]. The number 
of outliers is n/10. The left and the right spreads of outliers were generated 
from Normal distribution with mean=5 and variance=2. All the centers were 
generated from the uniform distribution in [0, 1]. 

The mean square errors (MSE) between prototypes obtained by performing our 
clustering model and the ideal prototypes are shown in tables 5 and 6. From these 
tables, it can be observed that MSE of the centers are more than MSE of left spreads 
and right spreads in case a, while MSE of spreads are higher than those of centers 
in case (3. In both cases, small weighting factors (large values for \) are assigned 

to data points fitting well to one of the clusters whereas large Uk (small values for 
4r) are assigned to outliers. Thus, outliers can be easily identified by their large 
weighting factors. 
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7 Conclusion and future works 

This paper presented a fuzzy clustering model for fuzzy data based on a new dis- 
tance. We have modified Keller's approach so that our model can be used in noisy 
environments. The weighting factors reduce the influence of outliers and enable us 
to identify them. Necessary conditions for the objective function to receive an opti- 
mum have been derived to calculate a partition of data. Also, in another approach, 
we transformed our distance to the Euclidean distance and reduced the problem of 
fuzzy clustering of fuzzy data to fuzzy clustering of crisp data. Finally, two simula- 
tion experiments were considered; one for comparing the performance of our model 
with those of other existing clustering models for fuzzy data and one for testing how 
well our model behaves in noisy environments. 

Our model can be applied in settings where the presence of outliers can drastically 
affect the results. An example is the process control problem in which the presence 
of outliers usually represents that the process has been out of control. 

Another problem that can be explored is to study in depth fuzzy clustering for 
interactive fuzzy data and determining the optimal weighting exponent (m). 
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Tabic 1: Percentages of well-classified objects with membership higher than u = 0.5, u = 0.75, u = 0.9 (6 = 1.5) 



Our model D'Urso and Giordani (2007) Hathaway et. al. (1996) Yang et. al. (2004) 

Yang and Liu (1999) 







u = 0.5 


u = 0.75 


u = 0.9 


u = 0.5 


u = 0.75 


u = 0.9 


u = 0.5 


u = 0.75 


u = 0.9 


u = 0.5 


u = 0.75 


u = 0.! 


n = 


10 


97.53 


74.94 


38.14 


94.11 


66.22 


31.78 


94.78 


56.61 


19.11 


84.83 


54.94 


24.39 


n = 


50 


97.86 


70.14 


32.03 


94.18 


58.47 


26.14 


94.34 


48.36 


15.32 


85.97 


47.81 


19.74 


n = 


100 


97.68 


69.12 


31.16 


91.45 


57.74 


25.87 


92.88 


47.21 


14.82 


85.57 


46.27 


18.90 


k = 


2 


93.33 


73.17 


38.02 


84.36 


64.74 


31.08 


85.05 


53.29 


18.16 


82.18 


55.59 


24.33 


k = 


8 


98.95 


70.86 


31.89 


96.11 


59.95 


27.15 


98.26 


49.17 


15.67 


86.50 


47.80 


19.59 


k = 


16 


99.68 


70.21 


31.11 


99.27 


58.73 


25.56 


98.69 


49.71 


15.42 


87.69 


45.64 


19.11 


a 




100.00 


98.83 


52.63 


100.00 


97.30 


51.32 


93.36 


50.91 


16.47 


100.00 


87.29 


39.82 






94.78 


43.69 


15.33 


86.49 


24.32 


4.54 


94.64 


50.54 


16.37 


70.91 


12.06 


2.20 


m = 


: 2 


97.13 


81.64 


62.27 


93.12 


71.75 


53.56 


94.34 


69.79 


32.62 


85.50 


60.51 


40.90 


m = 


: 3 


97.49 


60.48 


4.43 


93.37 


49.87 


2.30 


93.66 


31.66 


0.21 


85.41 


38.84 


1.13 


h = 


1/2 


91.94 


55.20 


29.02 


91.96 


51.79 


25.44 


91.96 


51.31 


21.95 


76.28 


54.23 


27.59 


h = 


1 


99.72 


70.65 


29.88 


100.00 


74.77 


28.33 


99.94 


50.65 


5.29 


82.29 


52.24 


25.01 


h = 


2 


100.00 


88.96 


42.06 


87.78 


55.87 


30.01 


90.10 


50.21 


22.01 


97.79 


42.55 


10.43 
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Table 2: Percentages of well-classified objects with with membership higher than u — 0.5, u = 0.75, u = 0.9 (9 = 0.75) 



Our model D'Urso and Giordani (2007) Hathaway et. al. (1996) Yang et. al. (2004) 

Yang and Liu (1999) 





it = 0.5 


u = 0.75 


u = 0.9 


u = 0.5 


u = 0.75 


u = 0.9 


u = 0.5 


u = 0.75 


u = 0.9 


u = 0.5 


u = 0.75 


u = 0.! 


n = 10 


89.86 


47.72 


17.47 


89.78 


39.89 


12.39 


88.72 


32.28 


6.06 


81.89 


37.39 


10.33 


n = 50 


91.34 


35.28 


8.23 


89.43 


29.17 


5.07 


87.87 


20.81 


1.73 


79.39 


28.37 


4.71 


n = 100 


92.26 


32.08 


7.19 


87.78 


28.86 


4.58 


86.27 


19.91 


1.61 


80.20 


27.63 


4.76 


k = 2 


82.82 


44.91 


18.49 


81.40 


41.03 


13.75 


78.98 


32.25 


7.67 


74.43 


39.50 


11.97 


k = 8 


93.25 


35.11 


8.96 


88.95 


29.18 


4.99 


89.11 


21.31 


1.12 


82.85 


27.67 


4.41 


A: = 16 


97.17 


32.49 


6.53 


96.64 


27.71 


3.29 


94.78 


19.43 


0.61 


84.20 


26.22 


3.44 


a 


98.52 


60.97 


18.29 


98.97 


55.20 


12.57 


86.33 


24.77 


2.93 


98.99 


54.97 


11.39 




83.32 


14.01 


3.15 


79.02 


10.07 


2.12 


88.91 


23.89 


3.34 


61.99 


7.28 


1.81 


m = 2 


90.58 


59.62 


20.41 


89.14 


55.50 


14.44 


87.89 


43.68 


6.18 


80.64 


50.71 


12.95 


m = 3 


91.94 


16.16 


1.95 


88.86 


9.77 


0.25 


87.35 


4.99 


0.09 


80.34 


11.54 


0.25 


ft = 1/2 


83.88 


38.21 


14.67 


84.75 


29.75 


4.04 


84.75 


29.75 


4.04 


75.21 


38.71 


11.62 


ft = l 


91.75 


35.84 


11.44 


97.62 


31.66 


5.29 


96.87 


14.71 


1.22 


75.90 


31.06 


6.22 


h = 2 


97.94 


38.60 


7.74 


84.63 


36.51 


12.71 


81.24 


28.53 


4.13 


90.37 


23.61 


1.97 
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Table 3: Average percentage of well-classified objects (6 — 1.5) 



u = 0.5 


u = 0.75 


u = 0.9 


Our model 97.39 


71.38 


33.69 


D'Urso and Giordani 93.25 


60.81 


27.93 


Hathaway et.al. 94.00 


50.72 


16.42 


Yang et.al. 85.46 


49.67 


21.01 


Table 4: Average percentage of well-classified objects (0 = 0.75) 


u = 0.5 


u = 0.75 


u = 0.9 


Our model 91.12 


37.70 


11.09 


D'Urso and Giordani 89.00 


32.56 


7.35 


Hathaway et.al. 87.62 


24.33 


3.13 


Yang et.al. 80.49 


31.13 


6.60 



Table 5: Mean Square Error (MSE) for cluster prototypes - case(a)(with outliers) 







Centers (MSE) 


Left Spreads (MSE) 


Right Spreads (MSE) 


n = 


100 


0.0035 


0.0015 




0.0016 


n = 


200 


0.0025 


0.0007 




0.0007 


n = 


300 


0.0022 


0.0005 




0.0005 


k = 


2 


0.0025 


0.0014 




0.0013 


k = 


8 


0.0024 


0.0007 




0.0007 


k = 


16 


0.0034 


0.0006 




0.0006 


Table 


6: M. 


oan Square Error 


(MSE) for cluster prototypes - i 


case(/3)(with outliers) 






Centers (MSE) 


Left Spreads (MSE) 


Right Spreads (MSE) 


n = 


100 


0.0031 


0.1365 




0.1363 


n = 


200 


0.0014 


0.1447 




0.1441 


n = 


300 


0.0010 


0.1481 




0.1482 


k = 


2 


0.0033 


0.0140 




0.0140 


k = 


8 


0.0010 


0.1232 




0.1227 


k = 


16 


0.0010 


0.2937 




0.2932 
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